Goto

Collaborating Authors

 nano banana


ChatGPT image generation is now faster and better at following tweaks

Engadget

Plus, there's a new dedicated sidebar with presets and prompt suggestions. ChatGPT is available on both iOS and Android. Following the release of GPT-5.2 last week, OpenAI has begun rolling out a new image generation model . The company says the updated ChatGPT Images is four times faster than its predecessor. If you're a frequent ChatGPT user, you'll know it can sometimes take a while for OpenAI's servers to create images, particularly during peak times and if you're not paying for ChatGPT Plus.


Google's AI Nano Banana Pro accused of generating racialised 'white saviour' visuals

The Guardian

The logos of organisations were also included in images generated by Google's Nano Banana Pro AI tool. The logos of organisations were also included in images generated by Google's Nano Banana Pro AI tool. Google's AI Nano Banana Pro accused of generating racialised'white saviour' visuals Nano Banana Pro, Google's new AI-powered image generator, has been accused of creating racialised and "white saviour" visuals in response to prompts about humanitarian aid in Africa - and sometimes appends the logos of large charities. Asking the tool tens of times to generate an image for the prompt "volunteer helps children in Africa" yielded, with two exceptions, a picture of a white woman surrounded by Black children, often with grass-roofed huts in the background. In several of these images, the woman wore a T-shirt emblazoned with the phrase "Worldwide Vision", and with the UK charity World Vision's logo.


Hands On With Google's Nano Banana Pro Image Generator

WIRED

Google's latest AI image model is vastly better than the previous release at generating text in images. You can expect companies to go buck wild with this update. Nano Banana Pro generated this image, assembling a crowd of standalone characters into one scene. Corporate AI slop feels inescapable in 2025. From website banner ads to outdoor billboards, images generated by businesses using AI tools surround me.


EdiVal-Agent: An Object-Centric Framework for Automated, Fine-Grained Evaluation of Multi-Turn Editing

Chen, Tianyu, Zhang, Yasi, Zhang, Zhi, Yu, Peiyu, Wang, Shu, Wang, Zhendong, Lin, Kevin, Wang, Xiaofei, Yang, Zhengyuan, Li, Linjie, Lin, Chung-Ching, Xie, Jianwen, Leong, Oscar, Wang, Lijuan, Wu, Ying Nian, Zhou, Mingyuan

arXiv.org Artificial Intelligence

Instruction-based image editing has advanced rapidly, yet reliable and interpretable evaluation remains a bottleneck. Current protocols either (i) depend on paired reference images-resulting in limited coverage and inheriting biases from prior generative models-or (ii) rely solely on zero-shot vision-language models (VLMs), whose prompt-based assessments of instruction following, content consistency, and visual quality are often imprecise. To address this, we introduce EdiVal-Agent, an automated and fine-grained evaluation framework grounded in an object-centric perspective, designed to assess not only standard single-turn but also multi-turn instruction-based editing with precision. Given an input image, EdiVal-Agent first decomposes it into semantically meaningful objects, then synthesizes diverse, context-aware editing instructions while dynamically updating object pools across turns. These two stages enable two novel object-centric metrics tailored for multi-turn evaluation and one global metric of visual quality: (1) EdiVal-IF, which measures instruction following by combining open-vocabulary object detectors for symbolic checks with VLMs for semantic verification on detector-guided crops; (2) EdiVal-CC, which evaluates content consistency by calculating semantic similarity of unchanged objects and background using the evolving object pools; and (3) EdiVal-VQ, which quantifies changes in overall visual quality with human preference models. Instantiating this pipeline, we build EdiVal-Bench, a multi-turn editing benchmark covering 9 instruction types and 13 state-of-the-art editing models spanning in-context, flow-matching, and diffusion paradigms. We demonstrate that EdiVal-Agent can be used to identify existing failure modes, thereby informing the development of the next generation of editing models.


Nvidia CEO Jensen Huang Is Bananas for Google Gemini's AI Image Generator

WIRED

Nvidia CEO Jensen Huang Is Bananas for Google Gemini's AI Image Generator The Nvidia CEO reveals his consuming love for Google's image generator, the artsy side of Grok, and what exactly he uses Perplexity, Gemini, and ChatGPT for right now. Nvidia CEO Jensen Huang is in London, standing in front of a room full of journalists, outing himself as a huge fan of Gemini's Nano Banana . "How could anyone not love Nano Banana? I mean Nano Banana, how good is that? Tell me it's not true!" "Tell me it's not true! I was just talking to Demis [Hassabis, CEO of DeepMind ] yesterday and I said'How about that Nano Banana! It looks like lots of people agree with him: The popularity of the Nano Banana AI image generator--which launched in August and allows users to make precise edits to AI images while preserving the quality of faces, animals, or other objects in the background--has caused a 300 million image surge for Gemini in the first few days in September already, according to a post on X by Josh Woodward, VP of Google Labs and Google Gemini. Huang, whose company was among a cohort of big US technology companies to announce investments into data centers, supercomputers, and AI research in the UK on Tuesday, is on a high. Speaking ahead of a white-tie event with UK prime minister Keir Starmer (where he plans to wear custom black leather tails), he's boisterously optimistic about the future of AI in the UK, saying the country is "too humble" about the country's potential for AI advancements. He cites the UK's pedigree in themes as wide as the industrial revolution, steam trains, DeepMind (now owned by Google), and university researchers, as well as other tangential skills. "No one fries food better than you do," he quips. Nvidia announced a $683 million equity investment in datacenter builder Nscale this week, a move that--alongside investments from OpenAI and Microsoft--has propelled the company to the epicenter of this AI push in the UK. Huang estimates that Nscale will generate more than $68 billion in revenues over six years. "I'll go on record to say I'm the best thing that's ever happened to him," he says, referring to Nscale CEO Josh Payne. "As AI services get deployed--I'm sure that all of you use it.